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ABSTRACT 

This paper on Thesauri in Educational Documentation 
attempts to show by description and analysis the difference in 
application between thesauri and classification schemes as indexing 
languages in documentary information systems, and from this to infer 
possible future developments. First/ the differences between the 
pre^coordination and post*coordination of terms are examined and a 
distinction is made between the library and documentary systems for 
indexing literature. Special attention has been paid to the manner of 
written and systematic presentation of index languages. Next, short, 
analytic descriptions of a number of existing thesauri or thesauri in 
preparation serve to introduce the problem of specificity versus 
universality in classification systems and thesauri. The Eudised 
Thesaurus is placed side by side with others for this purpose. From 
the comparison of methods of construction used by the various 
thesauri, the conclusion is drawn that an increasing integration 
between classification systems and thesauri niay be expected. 
(Author) 
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SYNOPSIS 



This paper on Thesauri in Educational Documentation attempts to show by description and 
analysis the difference in application between thesauri and classification schemes as indexing 
languages in documentary information systems, and from this to infer possible future developments « 

Classification systems and thesauri have not been considered from a theoretical or 
philosophical point of view, but an attempt has been made to find a documentary approach by 
studying both thesauri and classification systems in the actual forms in which they are presented. 

Firstly the differences between the pre-coordination and post-coordination of terrr.s are 
examined and a distinction made between the library and documentary systems for indexing 
literature. The importance of an index language as a controlled language system is subsequently 
explained . 

In addition to the differences in the application of classification schemes and thesauri as 
index languages, special attention has been paid to the manner of written and systematic presentation 
of index languages. This stems partly from the history of how the documentary thesaurus came into 
existence . Its construction and arrangement provide Valuable material for the study of the methods 
of providing easy (visual) reference in the documentary thesaurus. 

Next, short, analytic descriptions of a number of existing thesauri or thesauri in preparation 
serve to introduce the problem of specificity versus universality in classificaiiun systems and 
thesauri, 

The Eudised Thesaurus is placed side by side with others for this purpose. From the 
comparison of methods of contruction used by the various thesauri, the conclusion is drawn that 
an increasing integration between classification systems and thesauri may be expected. 

INTRODUGTiON 

The thesaurus has made its appearance in libraries and documentation centres and in all 
probability is there to stay. As is the case with a number of other modern phenomena, the 
appearance of the thesaurus was not merely accidental. 

The speed with which the thesaurus has come to occupy an important place in documentation 
systems is such as to demonstrate that the thesaurus cannot be regarded as a passing phenomenon, a 
flash-in-the*pan. On the contrary, it must be seen as a new and efficient aid, the logical result 
of the more practical approach, which includes post*coordination and the computer, adopted by 
libraries and documentation centres. 

If an attempt is to be made to describe the differences in practice between the new aid and 
classification systems in order to get a clearer picture of the thesaurus' place among the other 
instruments, it is necessary first to go into some basic principles. 

This paper is not an attempt to consider thesauri arid classification systems from a 
theoretical or philosophical angle. An effort has been made to arrive at a pcactlcal, documentary 
approach by looking at the thesauri and classifications as systcmatiscd presentations of indexing 
languages designed to facilitate the use of literature data in a data retrieval system. 

There are two very important ways of approaching the arrangement of data in a retrieval 
system. It is essential to consider in some detail the difference between pre-coordination and 
post-ccordination. 



PRE- AND POST-COORDINATION IN DOCUMENTARY INFORMATION 



In the concepu of pre- and post-coordination the term •cocfdinatiorf is used to mean 
•correlated arrangement*. In documentary information the term pre-coordlnation is often used 
to mean 'arranged according to a permanent or virtually constant correlation* . 

The term post^coordination is used to indicate that a desired correlation has not been 
introduced beforehand but comes into being at whatever moment there proves to be a need for 
it. There are two Important applications in respect to documentary information, viz, ine 
coordination of words and of descriptors. 

In coordinating words, only pre-coordinatlon is possible. This may be teferred to as 
linguistic pre-coordination . When compound words are made, new words come into being 
whose component parts lose some or all of their original meaning and the pre -coordinated word 
acquires an independent meaning. 

Pre-coordination of words plays an important part in determining and constructing 
descriptors. By descriptor is meant a concept In the form of a word, phrase or notation deliberately 
and explicitly chosen to be used as: 

- a representative of the content of the document in a retrieval system 
(storage side); 

- a search feature for retrieving literature data from a retrieval system 
(retrieval side). 

A descriptor may thus be either a single word or a combination of pre -coordinated words. 
Once it has been designated as a descriptor the compound word ac<;uires a permanent structure. 

If a number of different deicriptors together indicate the contents of a document, these 
descriptors may be stored in a retrieval system and retrieved by me^ns of pre-coordinatlon and 
post* coord ina tio n . 

Thefe methods will be referred to from now on as the Library Method and the Documentary 
Method respectively. 

The Library Method 

A characteristic feature of the conventional library method for Indexing literature is that^ 
almost without exception, pre-coordination Is employed. 

P» Otlet(6) explains it in the following termsi "On entend par classement bibliographique 
Part de disposer les ouvrages d*aprb leur mati^re (sujet ou contenu) et pat classification. La 
classification bibliographique est une otdre de suite , Elle se d^veloppc en une sdrie Undalre unique, 
dont tous les termes occupent. les tin par rapport aux autres, une place ou range d^Ign^ par uue 
signe(niot$, nombtes, ousymboles, quelconques ordonn^ en systime)**. 

There Is therefore an inherent difficulty In the system which arises when pre-coordlnated 
catalogues are consulted. The tiset needs to be fully Informed of the classification system 
employed by the cataloguer. To some extent he needs to be a mindreader. 

One natural cc mpensatlon for the rigidity of pre-cootdinatbn is cross*refetencIng or the 
introduction of any desired permutations. The {deviously established order of the descriptors Is 
upset. However, it will not often happen that all possible permutations are used. 



The Documentary Method 



The characteristic feature of this method U the use of post-coordination. The catalogue 
has almost disappeared and the file or data base has taken its place. 

There are three different ways of putting descriptors on file : 

- The mono "reference method to one docurnent 

(Unit card systerrJ.A surrogate document is made (a card or fiche) per 
descriptor for each document, 

- The m ulti*ref erence method to one document 

(Edge-punched card system). All the descriptors assigned to one document 
are entered on the informauon carrier (card etc.) but in such a way that each 
may be retrieved independently, 

• Th e mono-reference method to a number of documents 
(Superlmposable punch card system). In this the descriptors assigned to all the 

documents in a given collection form separate sub-collections with reference 
to the documents to which they were assigned; a so-called Inverted file. 

INDIXING LANGUAGES IN DOCUMENTARY INFORMATION 

By an indexing language in documentary information is meant the aid which is used in a 
retrieval system co identify the query and the offer of documentary data. An indexing language 
differs from a natural language in that It makes use of only a limited number of words or concepts. 

In most retrieval systems the original document is represented by a surrogate which gives 
the content of the relevant document in compressed form. For the purpose of compressing the 
information an indexing language is used which aho expresses systematic and associative 
correlations . 

There are three main reasons for using an indexing language in retrieval systems^ which will 
serve to explain why this method is preferred . 

- Reduction in the assignment of the number of concepts to be used j 

* Standacisation of the permitted desciiptots \ 

- Greater probability of recall. 

If an indexing language is presented In writing, showing primarily the systematic connection 
between the concepts, it is called a classification or code, The natural complement to a 
classification system or code is an alphabetic list. 

If the primary reference to concepts is alphabetical £iid if in addition the concepts ate 
cross-referenced^ the indexing language is called a documentary thesaurus. As one would expect, 
the natural complement to the thesaurus is the arrangement of descriptors according to theii 
systematic Correlation. 

A distinction is made, when speaking of indexing languages, between systems with a 
controlled language and those with a free language. In free language systems there are no 
constraints in the choice of words to be used in the system. 
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But this disHnciion fs not an entirely exact one. Even In free language systems the 
indexing language (aid to the identification of the query or offer of data) is in fact built up 
in the system. The thesaurus is therefore no longer an independent unit outside the system. 

The most striking feature of classification is the hierarchy. Classifications bring 
atrractive advantages to the pce-coordination of descriptors. There is a good deal of similarity 
between the rules of the hierarchy (whether it is a mono- or poly-hierarchy) and the exclusivity 
of the linear sequence of library classification. The strict genetic structure of the latter has 
not proved to be entirely satisfactory and has led to the development of facet classification. 
Classification may^ for that matter, be applied equally efficiently to systems using post- 
coordination . The thesaurus proves to be an Ideal aid in post-coordinating descriptors. The 
chief reason for this Ls probably to be found in the fact that post-coordination does not attach 
exclusive importance to hierarchic relationships. Every relationship, whetler hierarchic, 
associative or linguistic may be Important in forming a combination of desctlptois at the 
desired time . Although not so obviously suitable, thesauri can also be applied to systems using 
pre-coordlnaiion. 

The differences and similarities in the application of classifications and thesa^iri are 
explained more fully In the following pages, with special reference to the arrangement of the 
thesaurus. Firstly, however, a word about how the documentary thesaurus came into existence 
will not be amiss. 

FRO M ENCYCIX)PA£DIC TO DOCUMENTARY THESAURUS 

The word thesaurus is an ancient one, which in both Greek and Latin means '^treasure'* or 
"treasury". In the thirteenth century Btunetto Latinl (1220-1294) used It in the sense of an 
encyclopaedia. Later, in the sixteenth centuryi it was used to mean a dictionary. Members 
of the Estienne family of French printers wrote the Thesaurus Linguae Latlnae (second edition 1643) 
and the Thesaurus Graecae (1572). 

With the appearance in 1862 of the Thesaurus of English Words and Wirases compiled by 
Peter Mark Roget, a considerable change in the meaning of the word thesaurus came about. 

As we know, a dictionary is an alphabetical list of words, with their meanings, expressed 
by means of the Ideas the words are considered to reflect. Roget*s Thesaurus works In exactly the 
opposite way. He begins with the Idea or concept and adds a list of the words which r^iay suitably 
be used to express It. 

Roget*s intention was to produce a work which would be of help to writers In finding the 
right word (or phrase) to describe more or less abstract ideas. 

Roget's Thesaurus Is dlvid^d into three sectionst 

Part I - The Synopsis of Categories^ consisting of six classes further divided Into 

sub*classcs and sections j 
Part H - The actual thesaurus i 
Part ill • The alphabetical index. 

The remarkable feature Is the close Similarity between the classification and the thesaurus« 
ajfti the important role played by classification i 

Rogei*s wotk has had many imlutorsj similar publications have appeared in French^ 
Hungarian, Dutchi Swedish* Spanish and German; 

The development of the thesaurus idea» however, has also taken other directions. Three 
different types of thesaurus ate discussed below. 



The Lextcologtcal Thesaurus 



The grouping of the words In a language into Interrelated groups Is an extremely practical 
aid to the linguist. One of the best examples of this type of linguistic sfiidy Is F. Dornseiff's 
••l^er Deutsche Wortschatz nach Sachgpuppen**, published in 1832. 

The object of this work was to group the words of the German language according to their 
meaning. Dornseiff begins, as Rogei was later to do, with general ideas and then investigates as 
thoroughly as possible the different ways of expressing them in the German language. He lists 
ordinary conversational language and all derived forms such as underworld slang, popuUr slang, 
nursery language, etc. 

The work consists of a numbered classification, the main section arranged systematically, 
and an Incomplete alphabetical index. It also contains full linguistic and bibliographical 
references, Dornseiff is in fact Roget's predecessor. However, the typical •'feedback'* from phrase 
to idea, which Is so interesting in Roget, is only partly developed. But this was not Domseiff's 
main objective. 

The Conversion Thesaurus 

In a lexicological thesaurus there is hardly any feedback to words, while the conversion 
thesaurus reverses the process. One no longer begins with ideas, with lists of words attached, but 
with the words and expressions in common use which are referred back to key words. This method 
is applied particularly in the study of the possibilities of mechanical translation. 

In 1957 (about a century later than Roget) H.P. Luhn developed a conversion thesaurus with 
the aid of word frequency counts from a given number of documents. He groups synonyms and 
near-synouyms in families (geni) and each genus Is indicated by one keyword. His object is to 
develop an aid to the mechanical abstraction of texts. 

The characteristic features of the conversion thesaurus are therefore the referring back of 
words or expressions to keywords from a controlled language, ^nd the absence of any cUssificatipn . 

The Documentary Thesaurus 

The lexicological and conversion thesauri were developed chiefly foi use In the study of 
linguistic problems. The conversion thesaurus, however, began to enter the field of librarlanship 
and documentation, A further development was to take place In this field, 

Ii; 1957, C,L. Befnlei %nd K.F. Heumann published an article (2) entitled ""Correlative 
Indexes". In It they use the term *theiautus* to mean a complex network of words with mote or 
less abstract meanings. The least abstract words, ex^Tessing thing;s which can be weighed, measured 
or counted, are used as basic expressions which must be related to as many abstract Ideas as possible, 

In 1958 T. Joyce and R,M, Needham (4) linked the thesaurus directly With documentary 
information In their article entitled t '*The Thesaurus Approach to Information Retrieval". It was 
wobably Vickery ( 10) who In 1960 gave a name to the new type of thesaurus and put two of Its 
distinguishing features into wordst 

? » The documentary thesaurus assists in Infoimatlon storage and retrieval by converting 
wordi from everyday language into descriptors I 

2. The documentary thesaurus is also a summary of the relationships between the 
descriptors, 
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It is now possible to define the documentary thesaurus a$ follows t 



A (documenuty) thesaurus is a fixed or chosen list of descriptors and non-descripiors 
in alphabetical order for use in an information storage and retrieval systemi in 
which hierarchic^ associative or linguistic relationships between the descriptors 
can be shown. 

ARRANGEMENT OF A THESAURUS (7) 

Descriptors are chosen or fixed . There must therefore always be an authority to select, 
1imir» establish and make rules for the admission of new descriptors. 

The descriptors for a retrieval system can be selected In three ways: 

* By a process of deductfon» motivated by a desire for completeness | 

- By a process of Induction, for example from frequency counts based on a 
limited collection of documents ; 

- From experience, based on the more or less known usefulness of descriptors. 
The limitations of a thesaurus may consist in its being restricted to; 

- One subject or branch of science ; 

- One organisation or Institution ; 

- one collection of documents. 



Linguistic forms play their part in the fixing of descriptors. A dJijcrlptor may be made up of 
one single word or of a group of pre*coordinated words. The single word is called a unitermt the 
pre-ccordlnated descriptors arc called polyterms or multiterms (8)* 

There is a connection between the specificity and the degree of pre»coordination of a 
descriptor. Its usefulness will be optimal if on the one hand the frequency of use is not too high 
as a result of its meaning being too general and on the other hand not too low because it is too 
specific. The nature of the documents and the query situation of the users of the retrieval system 
both have some Influence on this. 

The arrangement of the thesams Itself is governed by the measure of search guidance it is 
desired to build into it. The measure of search guidance is determined by the measure In which 
the correlation between the descriptors is made known. A distinction must be made^ between natural 
and fixed relationships. 

Among the natural relationships are those between descriptors and non^escrlptors. Non* 
descriptors are words (or word combinations) which cannot be used as search features. Synonyms 
(different words with the same meaning) are considered as hon*descriptors and must be marked 
(•see*, or *use* .,,). Ouasi-synonyms (e.g. acronyms) are also non*deicrlptors and must be back- 
referenced » Homonyms (words of the same form as others but of different sense) may also be 
back-referenced » However the meaning Is often narrowed down by means of an identifier . 

Fixed relationships may be divided into two categories^ vertical and horizontdU Vertical 
relationships are those between genui and species, and horL2ontaU the associative relationships 
(•see aUo^4 




In a thesaurus it Is possible to Indicate relationships between descriptors in many way^ 
by means of symbols. The following list is In almost general use, however. 



Symbo l Standing for Meaning 

D Desctlptor The actual term used in search 

SN Scope note short explanation of the term 

UF Used for Summary of the non-descriptors 

(not for use in system) 

BT Broader term Genus relationship 

NT Narrower term Species relationship 

RT Related term Associative relationship 



Looking at the use of the thesaurus from the point of view of a search for literature, the 
network of the relationships between descriptors appears as a complex, built-in search guide , It 
is possible to arrange this complicated network In a variety of ways» 

SETTING OUT THE THESAURUS 

It is possible to show the correlation between descriptors in a thesaurus . There are two 
distinct methods: 

- By grouping descriptors according to their family relationshlpi 
• By diagrams of the relationships between descriptors. 

One purpose of making the thesaurus readable is to facilitate Its lexicological management. 
Another reason is to make it easier to find descriptors for the analysis of documents and the search 
for literature. 

This ••readability" makes it possible for the user to search systematically for an Idea, It 
rrtust be remembered, however, that the visual system Is subordinate to the primary, alphabetical 
use of the thesaurus and therefore can never be as valuable as a traditional classification system (3). 

Grouping in Desctlptor Families 

In composing descriptor families it must be remembered that each descriptor may belong to 
one family only. The usual method of grouping is as folbwss a scope note is worked out for each 
distinct family and a separate synopsis is given of all family scope notes. The descriptors belonging 
to each family are listed alphabetically. 

In the thesaurus proper, the main descriptors (i.e. the descriptors for which no BT is to be 
found in the thesaurus) are followed by a reference to their family relationship. This can be In the 
form of a noution (numbers and/or letters). The notatiort is the link between the alphabetical and 
the systematic paru of the thesaurus. In this way it has also become possible to determine the 
family relationships of a descriptor in the alphabetical section very quickly, 

Diagammatic rcptesentaHon of descrlpto t lelatlonships 

One of the fundamental problems In documentaty Information Is the tdePitlflcatlon of the 
foliowingt 

1» the ideas of the author of a document^ of the person processing the document and of 
the person requesting literature! with 
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2. the same ideas uansformed into concepts expressed in ordinary language^ 
and in turn, with 

3» the same concepts transformed from ordinary language into descriptors which 
are part of a controlled language system. 

Diagrams of the relations between descriptors can prove extremely helpful for the 
purpose of such identification. They can demonstrate relationships in a way which neither 
classifications nor thesauri can. 

The arrow and circle graphs good examples of diagrams showing descriptor 
relationships. 

Constructi n g arrow graphs 

Arrow graphs (8) are constructed by first listing all the terms used for a particular subject. 
They are subsequently grouped in a number of families of related terms. 

The terms belonging to one family are written down in such a way that each is positioned 
in accordance with its relationship to the other terms. 

Once the synonyms and quasi-synonyms have been eliminated, it Is possible to construct 
polygons around terms which are closely related. Then a nucleus term is shown inside each polygon 
and this term Is finally given the status of a descriptor, ^ 

If the sides of the polygoa are now replaced by arrows, an arrow graph Is formed, A single 
arrow is used to show an NT relationship and a double one for an RT relationship, (See Appendix I), 

Constructing circle graphs 

One disadvantage of the arrow graph Is that it is not always immediately obvious which is 
the NT and which the BT in a hierarchic relationship, The circle graph (9) provides a simple 
solution to this problem, It consists of concentric circles showing tlie hierarchic levels, (See 
Appendix II.) 

The BTs are found in the centre and derived NTs on the clrcumfererice. Arrows are also 
used in this type of graph, pointing from BT to NT. But beginning with the outer circle, the 
BT can be found via the NT, 

THE KUPISED THESAURUS AND OTHERS (13) 

If one regards classifications and thesauri as being confined to coatrolled language systems, 
they ate essentially Vocabularies whose scope Is limited. Their scope is largely determined by the 
purpose for which they ate designed. 

Classifications* and in particular the universal classifications, have yet another dimension, 
Ihey Claim to cover human knowledge In a particular field (the universal classifications cover the 
sum idtal of human knowledge), in a systematic, correlated form, A good example of the universal 
classification is the Universal i>ecimal Classification (UDC), 

Too close an apf^oach to universality may create difficulties In the direct use of 
classification in a controlled language system. The almost Inexhaustible possibilities of the UDC 
for consttucting detailed concepts by assigning fAcets to tfie main Concepts means that a second 
step is urgently needed . A choice will have to be made and a limited collection of concepts 
compiled, which may be used to the exclusion of all other possibllties. Only in this limited 
sense can the universal classification be called an index language « 



The question may now be asked to what extent thesauri, built up as they are from 
empiiicl5m» may be endowed with some measure of generality, of universality, It does not 
seem possible to glvo. a definite answer to this question as yet. A number of interesting and 
In this connection relevant forms of ihesauris have appeared which may perhaps provide some 
Indications as to how the question may be answered. 

The Macrothesaurus 

The OECD Macrothesaurus (15) is an aitempt to demonstrate that It is possible to create 
a common descriptor language. Its purpose is to coordinate the thesauri already In use by the 
individual organisations of the UN family* A link mechanism was designed for the various data 
bases which are each arranged in accordance with their own thesauri. The link mechanism does 
not contain all the descriptors, neither does It contain only those from all the linked retrieval 
systems. A selectkn hdd to be made In order to ensure ihatx 

- The descriptors would mean the same thing to all users ; 

- The descriptors would actually be used by a reasonable number of the users j 

If the Idea of a macrothesaurus is the right one, the structure would have to be such as to 
leave room for subjects not already included. 

It was decided In 1969 to compile the macrothesaurus and this was carried out following 
strictly Inductive methods. After Inventorying, seven thousand suggested descriptors were selected 
and circulated to all the participants in the project with the questlonsj 

- Whether the suggested descriptors were used by that institution, and 
• Whether the suggested descriptors should or should not be Included. 

In this way a general and common collection of descriptors was formed, and the Macro* 
thesaurus is therefore a thesaurus of macro*descriptors which are so general as to be useful to all 
the participants. The project was limited mainly to aspects of economic development work. 

The Eudised Thesaurus 

The Eudised Thesaurus (12), which is being developed by the Council of Europe for use 
in the exchange between member countries of educational information, is being compiled on a 
different principle from that underlying the macrothesaurus. 

The Eudised Thesaurus leaves the national educational thesauri of each participating 
country intact in principle and is an attempt to gain a multi«'lingual entry to the existing 
educational thesauri. The multl*lingual aspect of the Eudised Thesaurus is a fundamenul 
feature of It and does away with the objections to the use of one common ot carrier language. 

A carrier language means double work for other language areas when assigning descriptors. 
In the analysis of documents the cartler language also tends to dominate the user language, 
which inevitably leads to loss of Information, 

The subject of education Is well known for its strongly national charactetistici * Its 
Organisation Is*ofteri determined by history, add educational ideals are closely linked to the 
cultural background of the nation. This produces i situation in which national educational 
iheiaurl contain desctlptori for which there is no equivalent In another language, 



The EudUed Theiaufuj might be described ai a national thesaurui In the firit place^ 
with a common link mechanUm for the purpose of internailonal conjultatlon and exchange 
of information. 

The Eudlsed Thesaurus will therefore consUt of at least the following two main sectlonsi 

- The systematic synopsis of the descriptor groups for which equivalents exUt in the 
languages of all the participants in the project \ 

- The alphabetical, cUsslfied list of all 4e descriptors In one Unguage, 

The chief problem facing the Eudised Thesaurus is thus the compilation of the first 

section, 

l!]gJMYyii ^^^y of Thesauri 

Both the Macrothesauius and the EudUed Thesaurus can claim a cerUin generaUty. It 
has not, however, l>:ien proved that the construction methods followed will lead to universaUty 
if extended. 

The classification school of thought malnuins that it is not possible to arrive at a universal 
thesaurus by way of the construction of the thesaurus. 

As we know, the relationships between the descriptor* in a thesaurus are arbitrarily fixed. 
True universality can only be achieved by obeying rules which apply generally • The nature of 
the thesaurus Itself makes It impossible to make any such rules* The tenitory of a thesaurus may 
be either dtscipUne-oriented or mission^oriented. 

If the territory Ls dlsclpline-orlentcd, It would not appear possible for it to go beyond the 
boundaries of that subject of its own accord . On the other hand, in the thesaurus which U miision- 
orlented there is scope for a multl-disclpUnaiy approach. However, the ordering of the poly 
hierarchy thus obtained wouW seem to be reserved for a classlflcatipn system. 

A urAvetsal superstructure In the nature of classification may well be the solution to the 
greater universality of a thesaurus. And thU is the direction in which the ihesaurofacet(i) seems 
to be pointing. 

The UNESCO UNISIST-^BSO project presents similar problems. 
The I'hesaurofacet 

The Thesaurofacet U a good example of how the cUsslficatlpn system and the thesaurus 
may be inttpated, being as It U both thesaurus and a facet classlflcaUon system for all engineering 
and related subjects. 

The Tliesaurofacet covers the whole field of the natural iclences and technology and cotisUti 
of two complementary parts, a classified section and a thesaurus. 

The eiasstficaaon groups the main subjecu in reUtion to each other, in accordance with 
th^ traditional rules of classification. Within each subject, the ideas are arranged in facets j within 
the facets, In hierarchies and series, with & letter and number noutlon. 
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The thesaurus section Is arranged according to the usual pattern, BTs and NTs refer 
to hierarchically broader or harrower terms which do not appear In the part of the classified 
section which coincides with the descriptor. Non-hierarchic relationships are Indicated by 
^related terms". 

There are a great many categories of non*hlerarchic relationships which frequently 
appear In totally different subject areas. Relationships of this kind are not at all easy to 
classify for convenient reference, but of course they appear in the thesaurus section. The 
latter section in the Tl^sauiofacet to some extent replaces the alphabetical index usually 
appended to a conventional system of classification. 

The information which is given In the thesaiuus section to go with the descriptor is 
complementary to the information given by the classified section. Although the Thesaurofacet 
is primarily constructed as a controlled language, with descriptors for use In controlled Indexing 
and searching, the Integration of classification and thesaurus gives the system a much more 
universal character. 

UNESCO's BSO/SRC Project 

UNESCO's BSO/SRC Ptoject may serve as a final example to illusuate the relationships 
between thesauri and classification systems. 

In the Unlslst Project, Involving a woild-embraclng network of Information systems, the 
need became evident for a switching mechanism which would link up the Individual information 
systems, each with their index language (thesauri, classification, keywords, etc.). 

The designing of the mechanism was entrusted to the FID. The project was described as 
follows during the course of the 1972 congress in Budapest: 

"The need for the establishment of a broad classification scheme or a Broad System 
of Ordering (BSO) In the framework of the Unlsist programme arises naturally as one 
of the aspects in the process of achieving compatibility between existing and future 
information systems. Having In mind the great diversity of classification schemes 
used In the world, it can be stated that ^^e main function of BSO 1$ to serve as a 
switching mechanism to link different individual classifications and thesauri, 
in the process of Information- transfer. " 

An FID working group on classification research defines classification as followst 

^'By classification Is meant any method for recognising relations, generic or other, 
between items of Information regardless of the degree of hierarchy used and of 
whether these methods are applied in connection with traditional or computerised 
Information systems.** 

A special working group of Uie FID was charged with the buUdlng up of a Subject-Field 
Refefertce Code (SRQ which would provide Onlslst with such a ftroad Scheme of Ordering, the 
Working group decided to publish the resulu of their work in three p^tux a scheme of 
classification with noutions^ a thesaurus with synbnyms, and a graph, 
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CONCLUSIONS 

The rather stormy passage the documentary thesaurus has had on the way to becoming 
an efficient instrument In Hbrarianship and documentation, and new Ideas on the basic principles 
of classification, may be an indication that thesauri and classification schemes will continue to 
exercise an influence on each other « , . 

The method of post-coordination makes it obllgaiory to have full permutation of the 
descriptors on file. The computer makes this possible and it looks as If this will lead to a more 
intensive study of the properties of documentary index languages. In constructing a thesaurus 
one is faced with the problem of achieving universality. The Thesaurofacet provides the first 
solution to the [>roblem of how the thesaurus and the classification scheme may be integrated. 
And, finally, the Eudised Thesaurus is a first attempt to find a rational solution to the problem 
of multi-lingual educational documentation. 
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APPENDIX n 

An example of a circle graph 




